# Apple Chip Optimization

Internvl3 8B 6bit
Other
InternVL3-8B-6bit is a vision-language model converted to MLX format, supporting multilingual image-text-to-text tasks.
Image-to-Text Transformers Other
I
mlx-community
70
1
VL Rethinker 7B 6bit
Apache-2.0
This is a multimodal model based on Qwen2.5-VL-7B-Instruct, supporting visual question answering tasks, converted to MLX format for efficient operation on Apple chips.
Text-to-Image Transformers English
V
mlx-community
19
0
Smolvlm2 256M Video Instruct Mlx
Apache-2.0
This is a video-text-to-text model converted based on the MLX framework, suitable for video understanding and instruction-following tasks.
Image-to-Text Transformers English
S
mlx-community
591
7
Smolvlm2 500M Video Instruct Mlx
Apache-2.0
This is a video-text-to-text model based on the MLX format, developed by HuggingFaceTB, supporting English language processing.
Image-to-Text Transformers English
S
mlx-community
2,491
12
Qwen2.5 VL 3B Instruct MLX 8bits
This is an 8-bit quantized version of the Qwen2.5-VL-3B-Instruct model, optimized for the MLX framework and supports image-text generation tasks.
Image-to-Text Transformers English
Q
moot20
27
1
Whisperkit Coreml
WhisperKit is a local speech recognition framework specifically designed for Apple chips, offering efficient automatic speech recognition capabilities.
Speech Recognition Other
W
do-not-use-this-account-token
1,044
2
Coreml Stable Diffusion V1 4
Other
A latent diffusion-based text-to-image generation model capable of producing realistic images from text inputs, suitable for artistic creation and research purposes.
Text-to-Image
C
apple
230
29
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase